Combining Statistical and Rule-Based Approaches to Morphological Tagging of Czech Texts
نویسندگان
چکیده
منابع مشابه
Combining Statistical and Rule-Based Approaches to Morphological Tagging of Czech Texts
is article is an extract of the PhD thesis (Spoustová, 2007) and it extends the article (Spoustová et al., 2007). Several hybrid disambiguationmethods are describedwhich combine the strength of hand-written disambiguation rules and statistical taggers. ree different statistical taggers (HMM,Maximum-Entropy and Averaged Perceptron) and a large set of hand-written rules are used in a tagging ex...
متن کاملCombining Rule-Based and Statistical Syntactic Analyzers
This paper presents the results of a set of preliminary experiments combining two knowledge-based partial dependency analyzers with two statistical parsers, applied to the Basque Dependency Treebank. The general idea will be to apply a stacked scheme where the output of the rule-based partial parsers will be given as input to MaltParser and MST, two state of the art statistical parsers. The res...
متن کاملRule Induction: Combining Rough Set and Statistical Approaches
In this paper we propose the hybridisation of the rough set concepts and statistical learning theory. We introduce new estimators for rule accuracy and coverage, which base on the assumptions of the statistical learning theory. Then we construct classifier which uses these estimators for rule induction. These estimators allow us to select rules describing statistically significant dependencies ...
متن کاملRule-based Tagging: Morphological Tagset versus Tagset of Analytical Functions
This work presents a part of a more global study on the problem of parsing of Czech and on the knowledge extraction capabilities of the Rule-based method. It is shown that the successfulness of the Rule-based method for English and its unsuccessfulness for Czech, is not only due to the small cardinality of the English tagset (as it is usually claimed) but mainly depends on its structure (”regul...
متن کاملStatistical modality tagging from rule-based annotations and crowdsourcing
We explore training an automatic modality tagger. Modality is the attitude that a speaker might have toward an event or state. One of the main hurdles for training a linguistic tagger is gathering training data. This is particularly problematic for training a tagger for modality because modality triggers are sparse for the overwhelming majority of sentences. We investigate an approach to automa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The Prague Bulletin of Mathematical Linguistics
سال: 2008
ISSN: 1804-0462,0032-6585
DOI: 10.2478/v10108-009-0002-x